Segmentation of Child-Directed Speech: A Statistical Approach

نویسنده

  • Natalya Muzinich
چکیده

This paper describes how distinctive features that classify speech sounds emerge from statistical analysis of Russian child-directed speech. The analysis is based on transcriptional representation of individual speech sounds. From the analysis of bigram distribution major natural classes such as consonants and vowels further subdivided into non-palatalized versus palatalized consonants and front versus non-front vowels can be computed. The results in the form of a probabilistic FSA exhibit strong associations between the uncovered subclasses of consonants and vowels that are supported by traditional linguistic analysis.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Speech Segmentation and Word Learning in Parallel: Scaffolding from Child-Directed Speech

In order to acquire their native languages, children must learn richly structured systems with regularities at multiple levels. While structure at different levels could be learned serially, e.g., speech segmentation coming before word-object mapping, redundancies across levels make parallel learning more efficient. For instance, a series of syllables is likely to be a word not only because of ...

متن کامل

A statistical model for word discovery in child directed speech

A statistical model for segmentation and word discovery in child directed speech is presented. An incremental unsupervised learning algorithm to infer word boundaries based on this model is described and results of empirical tests showing that the algorithm is competitive with other models that have been used for similar tasks are also presented.

متن کامل

How Ideal Are We? Incorporating Human Limitations into Bayesian Models of Word Segmentation

1. Introduction Word segmentation is one of the first problems infants must solve during language acquisition, where words must be identified in fluent speech. A number of weak cues to word boundaries are present in fluent speech, and there is evidence that infants are able to use many of these, including phonotactics However, with the exception of the last cue, all these cues are language-depe...

متن کامل

Input and uptake at months predicts toddler vocabulary: the role of child-directed speech and infant processing skills in language development

Both the input directed to the child, and the child’s ability to process that input, are likely to impact the child’s language acquisition. We explore how these factors inter-relate by tracking the relationships among: (a) lexical properties of maternal child-directed speech to prelinguistic (month-old) infants (N= ); (b) these infants’ abilities to segment lexical targets from conversation...

متن کامل

Co-occurrence statistics as a language-dependent cue for speech segmentation.

To what extent can language acquisition be explained in terms of different associative learning mechanisms? It has been hypothesized that distributional regularities in spoken languages are strong enough to elicit statistical learning about dependencies among speech units. Distributional regularities could be a useful cue for word learning even without rich language-specific knowledge. However,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006